Ranked Accuracy and Unstructured Distributed Search

نویسندگان

  • Sami Richardson
  • Ingemar J. Cox
چکیده

Non-uniformly distributing documents in an unstructured peer-to-peer (P2P) network has been shown to improve both the expected search length and search accuracy, where accuracy is defined as the size of the intersection of the documents retrieved by a constrained, probabilistic search and the documents that would have been retrieved by an exhaustive search, normalized by the size of the latter. However neither metric considers the relative ranking of the documents in the retrieved sets. We therefore introduce a new performance metric, rank-accuracy, that is a rank weighted score of the top-k documents retrieved. By replicating documents across nodes based on their retrieval rate (a function of query frequency), and rank, we show that average rank-accuracy can be improved. The practical performance of rank-aware search is demonstrated using a simulated network of 10,000 nodes and queries drawn from a Yahoo! web search log.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Study of Scalable Search Algorithm on Unstructured P2P System (Work-in-Progress)

We proposed a search algorithm to unstructured P2P network, which consists of ranked neighbor caching, queryhit caching, and file replication to free riders. And the simulation results show that the algorithm can extend the search region but reduce the search traffic, and also balance the network load, so that acquires the whole network scalable.

متن کامل

The quality of probabilistic search in unstructured distributed information retrieval systems

Searching the web is critical to the Web’s success. However, the frequency of searches together with the size of the index prohibit a single computer being able to cope with the computational load. Consequently, a variety of distributed architectures have been proposed. Commercial search engines such as Google, usually use an architecture where the the index is distributed but centrally managed...

متن کامل

PlanetP: Using Gossiping to Build Content Addressable Peer-to-Peer Information Sharing Communities

We introduce the PlanetP system, which explores the construction of a content addressable publish/subscribe service using gossiping between peers of an unstructured peerto-peer (P2P) community. Unlike many recent P2P systems that have focused on enabling very large-scale name-based object location, PlanetP does not build and maintain a sophisticated distributed data structure. Instead, PlanetP ...

متن کامل

Optimal reconfiguration of radial distribution system with the aim of reducing losses and improving voltage profiles using the improved lightning search algorithm

In this paper, a modified version of the lightning search algorithm is proposed in order to find the optimal reconfiguration of the switches and locate and determine the optimal capacity of distributed generation sources in the distribution feeder. The main optimization goals are to reduce ohmic losses and voltage deviations in the standard 33-bus and 94-node IEEE feeders. The simulation result...

متن کامل

Heterogeneous Search in Unstructured Peer-to-Peer Networks

Resource search or discovery is a fundamental issue in peer-to-peer (P2P) and grid studies.1 Search objects, or resources, can be cycles, storage spaces, files, services, addresses, and so on. In general, systems are employing three categories of P2P-network architectures to improve search performance: centralized (such as Napster, http://www.napster.com/), decentralized but structured (such as...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013